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ABSTRACT 

We present a new catalogue of 55 121 groups and clusters centred on Luminous Red 
Galaxies from SDSS DR7 in the redshift range 0.15 ^ z ^ 0.4. We provide halo 
mass (M 5 oo) estimates for each of these groups derived from a calibration between the 
4^ ■ optical richness of bright galaxies (M r ^ —20.5) within 1 Mpc, and X-ray-derived mass 

for a small subset of 129 groups and clusters with X-ray measurements. For 20 157 
high-mass groups and clusters with M500 > 10 13,7 M , we find that the catalogue 
has a purity of > 97 per cent and a completeness of ~ 90 per cent. We derive the 
mean (stacked) surface number density profiles of galaxies as a function of total halo 
mass in different mass bins. We find that derived profiles can be well-described by a 
projected NFW profile with a concentration parameter ((c) = (r2oo/r s ) ~ 2.6) that is 
approximately a factor of two lower than that of the dark matter (as predicted by N- 
body cosmological simulations) and nearly independent of halo mass. Interestingly, in 
spite of the difference in shape between the galaxy and dark matter radial distributions, 
both exhibit a high degree of self-similarity. We also stack the satellite profiles based 
on other observables, namely redshift, BCG luminosity, and satellite luminosity and 
colour. We see no evidence for strong variation in profile shape with redshift over the 
range we probe or with BCG luminosity (or BCG luminosity fraction), but we do find 
a strong dependence on satellite luminosity and colours, in agreement with previous 
studies. A self-consistent comparison to several recent semi- analytic models of galaxy 
formation indicates that: (1) beyond ~ 0.3r5oo current models are able to reproduce 
both the shape and normalisation of the satellite profiles; and (2) within « 0.3r5oo 
the predicted profiles are sensitive to the details of the satellite-BCG merger timescale 
calculation. The former is a direct result of the models being tuned to match the global 
galaxy luminosity function combined with the assumption that the satellite galaxies do 
not suffer significant tidal stripping, even though their surrounding dark matter haloes 
can be removed through this process. Combining our results with measurements of the 
intracluster light should provide a way to inform theoretical models on the efficacy of 
the tidal stripping and merging processes. 
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1 INTRODUCTION 

It has been recognized for some time now that the abun- 
dance and radial distribution of satellite galaxies within 
larger host systems can potentially provide strong tests of 
our current cosmological paradigm for structure formation. 
Perhaps the most well-known example of this is the so-called 
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'missing satellites problem' of cold dark matter (CDM) 
models, which refers to the apparent inconsistency between 
the CDM predictions and the Local Gro up satellite census . 
As pointed out by several groups (e.g. Mo ore et al.l l 1999: 
iKlvpin et ai1ll99si l. the count of the Milky Way and M31 
satellites is lagging behind the cumulative sub-halo mass 
function by several orders of magnitude. The solution to this 
problem may simply be that galaxy formation becomes very 
inefficient at low halo masses (perhaps due to feedback pro- 
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cesse s such as photoionisation and winds f r om supernovae; 
e.g.. iBullock et all l2000l: ISomervill e 2002; Kravtsov et aU 



Koposov et al] 12009). or it may signal a more funda- 



120041 : 

mental pr oblem with the underlying CDM theoretical frame- 
work (IKa mionkowski & Liddle 2000; Soergel &; Steinhardtl 
l2000l : lBode et al.ll200ll : IZentner Bullockll2003h .~ 

The abundance and distribution of satellite galaxies in 
more massive galaxy groups and clusters (hereafter collec- 
tively referred to as clusters) provides astronomers with 
another valuable check of our current structure formation 



paradigm. Furthermore , as can be readily shown with simple 
'abundance matching' (iVale fc O striker 2004: IConrov et al. 
2006; Shankar et al. 2006; Conrov et al.ll2007l : iBaidrv et al. 
2008; Moster et all l2010l : iGuo et al.1 l2010h argument^], a 



sizeable fraction of the overall galaxy population resides in 
(i.e., are satellites of) clusters. The processes that influence 
the abundance and distribution of satellites in clusters must 
therefore be properly taken into consideration in the de- 
velopment of a general theory for galaxy formation. These 
processes can be constrained by directly confronting obser- 
vations of the radial distribution of satellites with theoretical 
predictions. Finally, the formation and evolution of the most 
massive galaxies in the universe, i.e., the central brightest 
cluster galaxies (BCGs), as well as that of the diffuse intra- 
cluster lightEl (ICL), are currently believed to be intimately 
linked (via mergers and tidal disruption) to the satellite 
galaxy population. A detailed understanding of BCGs and 
the ICL therefore necessarily includes an understanding of 
the satellite population. 

The advent of large optical surveys and near-IR sur- 
veys has allowed for more detailed characterisation of radial 
profiles of satel li tes in clusters t han was previously p o ssible . 
ICarlberg et al] (Il997|. se e also Ivan der Marel et al.l [2000) 
and iMuzzin et al] (|2007T ) measured the r-band and K-band 
(respectively) satellite number density profiles of 15 X-ray- 
selected galaxy clusters (z ~ 0.3) in the CNOC1 survey. 
Both studies concluded the radial profiles were consistent 
with relatively cuspy distributions (in the inner regions) but 
with a concentration (c = r2oo/r s ~ 4, where r2oo is the ra- 
dius that encloses a mean density of 200 times the critical 
density of Universe and r s is the fitted scale radius) that 
is somewhat lower than that of the underlying dark mat- 
ter distribution predicted from dissipationless cosmological 
simul ations. Using th e satellites themselves as tracers of the 
mass, ICarlberg et al] performed a Jeans analaysis to mea- 
sure the total mass-to-light ratio as a function of radius 
and indeed found that beyond approximately 0.3r2oo the 
satellites / light traced the underlying mass distribution well. 
iLin et al I (|2004h stacked a sample of 93 local (z « 0.05) 
X-ray-selected galaxy clusters observed with 2MASS to de- 
rive the mean projected K-band satellite number density 
profile. C onsistent with the previously mentioned studies, 
Lin et al. (2004) found the stacked satellite number density 
profile could be well fit by a NFW distribution, but with 
a somewhat lower concentration still, of c ~ 2.9. The dif- 
ference in the derived concentrations may reflect differences 



1 A procedure to link galaxies to their DM haloes by matching 
their observed stellar mass functions to simulated halo mass func- 
tions. 

2 Which, by some estimates, may cont ain as much as 5 0% of the 
integrated stellar mass of clusters fe.g. iGonzalez et alj|2007h . 



in the way r2oo was estimated (jCar berg et al. use velocity- 
dispersion based method, whereas Lin et all used a X-ray 
temperature-mass scaling relation) , differences sample selec- 
tion, and differences in the mean redshifts of the samples. 
Due to the rela tively small sample siz es an d the limited mass 
range studied, Carlberg et a D (|1997T ) and ILin et al.1 (|2004l ) 
were unable to explore any dependencies the radial profiles 

may have on host system mass. 

Somewhat more recently, lHansen et al] (|2005h used the 
original maxBCG cluster sample derived from the SDSS 
to investigate the spatial distribution of satellites. The 
maxBCG algorithm finds overdensities of galaxies which 
have colours that place them on the red sequence of galaxies 
and which have a BCG that has colours that are compatible 
with the red sequence. The large survey area affored by the 
SDSS meant lHansen et al] could study a much larger sam- 
ple of clusters than was previously possible. They studied 
a sample of 6708 clusters in total and derived the stacked 
number density profiles of satellites in severa l bins of rich- 
ness. Consistent with the previous findings, [Hansen et al] 
found relatively cuspy distributions of the satellites. Inter- 
estingly, the inferred concentrations were much lower (they 
found c ~ 1) than that inferred previously for high mass 
clusters and, furthermore, found a relatively steep trend in 
concentration wit h cluster richn ess. However, it is impor- 
tant to note that lHansen et al] adopted a different defini- 
tion for the concentration than the previously mentioned 
studies. In particular, lacking a robust total halo mass esti- 
mate of the clusters in their sample, they adopted a more 
observationally-motivated definition for r2oo (which they la- 
bel as ^200)5 which they define as the radius at which the 
space density of galaxies is overdense by a factor of 200. The 
advantage of this choice is that the characteristic radius can 
be measured directly from the data. The disadvantage is 
that it complicates comparisons to theoretical models and 
simulations. 

On the theory side, the radial distribution of dark mat- 



(e.g.. ISDringel et alJ l200ll 


Gao et al. 


120041; Sales et^alj boOjj; 


Shaw et al. 



120071 : lAngulo et al.1 12009|; iKlypin et al.1 120101 ). In general, 
these studies find that at intermediate/large cluster-centric 
radii (r > 0.3 — 0.4r2oo) the dark matter subhaloes trace 
that of the underlying main halo. At smaller radii, how- 
ever, the distribution of subhaloes flattens significantly (un- 
like observed satellite galaxy profiles), which is most likely 
the result of efficient tidal disruption of the subhaloes as 
they pass close to the cen tre. Recent semi- analytic mod- 
els of galaxy formation (e.g.lCroton et al. 2006; Bo wer et al] 
120061 : iDe Lucia fe B laizot 20q3), which are based on merger 
trees extracted from such dissipationless cosmological sim- 
ulations, by necessity have to adopt assumptions about the 
dynamical evolution of stellar component of the satellites 
galaxies which is not included in the dissipationless simula- 
tions. In general, the derived satellite galaxy densi ty profiles 
are c uspier than the subhalo distribution (e.g. I Sales et al] 
I2007T ) and in better apparent agreement with the observa- 
tions, although to our knowledge no detailed comparisons 
have been made between current semi-analytic models and 
observations. We address this point further below. 

A number of recent numerical studies have attempted 
to predict the radial distribution of satellites using self- 
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consistent cosmologica l hydro dynamic simulations (e.g . 
iNagai fe Kravtsovll 20051 : ISaro et alJ200eSl : iDolag et al.ll2009h . 
In general, the predicted satellite distributions are more 
cuspy than the distribution of subhaloes in dissipation- 
less simulations, owing to the increased resiliency (from a 
tightly bou nd stellar compon e nt) of the satellites to tidal 
disruption. INagai &; Kravtsovl (|2005l) compared t heir s imu- 
lations to the ob served profiles of I Car lb erg et al.l Jl997) and 
Lin et al. (2004) and concluded that the simulated profiles 
approximately match the shape of the observed profiles over 
the wide radial range of 0.1 < r2oo < 1.0. However, they 
did not compare the observed and predicted normalisation 
of the profiles and due to the heavy computational expense 
could only simulate a handful of systems and therefore were 
unable to explore any depencence on host halo mass. 

The main aim of the present study is to improve upon 
previous observational measurements of the satellite num- 
ber density profiles of clusters. To this end, we use the DR7 
release of the SDSS. Clusters are indentified by looking for 
overdensities in fields centered on Luminous Red Galaxies 
(LRGs, see Section 2). Our final sample (which is publicly 
availabl^fl) ? after all cuts, is 55 1 21, which is a factor of ^ 8 
larger than lHansen et al.l (|2005h . To normalise our radial 
profiles and to assign halo masses for stacking, we use an 
optical richness- X-ray temperature-total mass scaling rela- 
tionship, to help facilitate comparisons with models. As we 
will show, we find that the derived satellite profiles are rela- 
tively well-described by cuspy NFW distributions, as found 
previously, but with a somewhat lower concentration, which 
is roughly a factor of 2 lower than that of the dark matter 
distr ibution (pred i cted f rom simulations). Furthermore, un- 
like [HanserTeTlil] (2005), we find no significant variation in 
the mean concentration as a function of halo mass, which 
we mainly attribute to the different definitions of r2oo that 
have been adopted (see Section 5.2). We also explore the de- 
pendence of the satellite radial distribution on several other 
properties, namely the redshift of the cluster, the luminosity 
of the BCG, and the luminosities and colours of the satellites 
themselves. 

A secondary aim of the present paper is to make detailed 
comparisons of the predictions of current semi-analytic mod- 
els of galaxy formation to our observations. To that end, we 
use the public SQL database for the Durham and Munich 
semi-analytic models to derive the predicted satellite num- 
ber density profiles. We analyse the mock observations in an 
identical way to the data (e.g., statistically background sub- 
tract using random patches of the simulated sky and stack 
clusters in an identical manner). At intermediate/large radii, 
where the effects of tidal disruption are evidently small, we 
find that the current models reproduce our observed pro- 
files remarkably well in both shape and normalisation, the 
latter owing to the fact that the models have been tuned 
to match the global galaxy luminosity function. At smaller 
radii, the results are sensitive to the details of how the 
merger timescale for infalling satellites is calculated in the 
models. Our observations can therefore be used to inform 
the models on the efficiency of satellite merrging and dis- 
ruption. 



3 http:/ /www. ast.cam.ac.uk/ioa/research/cassowary/lrg_clusters 
/lrg_clusters_dr7.fits 



The present paper is organised as follows. In Section 2, 
we define the cluster sample and we describe how they are 
assigned an estimate of halo mass using a mass-richness X- 
ray calibration in Section 3. We also discuss the quality of 
the catalogue in terms of purity and completeness in Section 
3. In Sections 4 & 5, we present the construction of stacked 
number density profiles of the clusters in our sample, and 
investigate how the satellite concentration varies with halo 
mass and other observable quantities. Finally, in Section 6, 
we compare the observed satellite profiles to a series of semi- 
analytic models of galaxy formation. 

Throughout this work we assume a cosmology of Qm = 
0.27, Ov = 0.73, and h = 0.71. 



2 CLUSTER SAMPLE 

There are multiple strategies one can follow to create cat- 
alogues of clusters from the data provid e d by the SDSS. 
A quick look at Table 1 of lHao et aL I (|2010h reveals a 
growing family of methods utilizing deep SDSS multi-band 
photo metry to partition galaxies in groups around on e 
BCG (|Koester et all 20071 : IWen et al.ll2009l : lHao et alJ20ld ). 
These algorithms proceed by first locating candidate BCGs 
and then assembling a list of surrounding candidate satel- 
lites. This can be done either in pseudo-3D space com- 
posed of t wo angular coord inates and the photometric red- 
shift (e.g.. IWen et alj 12009) or by requiring that the angu- 
lar proximity is complemented by si milarity in colour (e.g., 
iKoester et aDl2007l : lHao et aljfcoich . The success of these 
methods rests on the powerful assumption of existence of a 
BCG near the bottom of the cluster's potential well. 

We search for clusters in fields centered on LRG galax- 
ies (i.e., each LRG is a potential BCG sitting at the cen- 
ter of a cluster). Not only does this simplify and speed up 
the identification of clusters, it improves the reliability of 
the search in the higher redshift range where we do not 
have to rely heavily on increasingly more uncertain photo- 
metric redshift. With this simple modification, we can now 
take advantage of the large number of LRGs available in 
the SDSS DR7. T o select the LRGs, we apply the standard 
phot ometric cuts ([Eisenstein et al.ll20Qll ; [Tbieiro &; Percivall 
l201ll ). which are designed to allow for a passively evolving 
stellar population. Specifically, we shall use the colour, mag- 
nitude an d surface brightness criteria set down in equations 
4-13 of lEisenstein et al.1 (|200ll ). 

We restrict our clusters to lie in the redshift 0.15 < z < 
0.4. The lower limit is in place to minimise photometric con - 
tamination of the LRG sample (|Toieiro fc Percival 120101 ) , 
and the upper redshift limit allows for an approximately vol- 
ume limited sample of LRGs and a complete sample of satel- 
lite galaxies down to a modest magnitude limit M r = —20.5 
for all galaxies with z ^ 0.4 (See Section [3] for details). 
Although DR7 LRGs are not perfectly sampled due to fibre 
collision effects, we do not expect our results to be affected a s 
their completeness is ^95 per cent (Eise nstein et al.ll200lh . 
We find - 85 000 LRGs in the above redshift range in SDSS 
DR7, which we further reduce by ~ 3, 000 by eliminating 
fields containing gaps or survey edges (i.e. with less than 85 
per cent area completeness per field). 

Our final cut is to ensure that there exists only one 
BCG per system. Recall that our approach is assume that 
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all LRGs are potential BCGs living at the centers of clus- 
ters. In reality, some clusters (particularly massive clusters) 
will contain more than a single LRG and by default these 
systems would be identified as separate clusters, which is 
obviously undesirable. To address this issue, we perform a 
LRG neighbour search within r2oo in angular separation 
and Azlrg ^ 0.02, from the current LRG and keep the 
brightest one as the BCG. Any other LRGs within the clus- 
ter aperture are kept as satellites. This removes a further 
~ 10 000 duplicate objects from the sample and leaves a 
sample of ^ 72 000 cluster candidates. Although findings by 
ISkibba et all (|2010h suggest that the central galaxy in the 
cluster may not always be the brightest, we investigate the 
effects of cluster mis-centering in Sect ion [4.2.21 and find that 
these effects are small and do not affect our results. We find 
that a large fraction of these 70 000 (~ 20 per cent) cluster 
candidates in fact contain no excess galaxy overdensity (or 
richness above the background) in the surrounding environ- 
ment (see Section [3721 for details). These 'field' systems are 
correspondingly removed from our sample, which yields a fi- 
nal base sample of 55 121 c lusters. Thi s num ber exceeds the 
latest sample published by Wen et al . (2009) by a factor of 
2 and is similar in size to the GMBCG catalogue (|Hao et al.l 
2010). Naturally, owing to the similarity of the search tech- 
niques, these cluster catalogs have a large number of objects 
in common. We provide the interested reader with a further 
comparison in the later sections. 

In what follows, when calculating the stacked radial 
satellite profiles of our clusters, we will assume the clus- 
ters' centers to lie at the BCG position. There is, of course, 
some doubt as to how big the offset between the true centre 
and the BCG could be. We investigate this uncertainty in 
the Section [4.2.21 Note that not only our approach is conve- 
nient, but it provides an accurate and well-defined stacking 
centroid centre that can be used in the future for stacking 
of galaxy agglomerations of any apparent richness. 

Throughout the analysis presented in this paper, we use 
magnitudes corrected for effects of dust extinction (Schlegel 
et al). BCG absolute magnitudes are calculated using K- 
corrections provided by th e calc ulation package pro vided by 
IChilingarian et all (|2010h . The IChilingarian et all polyno- 
mial fits provide no correction for passive evolution, and we 
do not make use of any additional evolution corrections in 
this work. The reason for this is three- fold. Firstly, in this 
work we calculate the absolute magnitude of both satellite 
galaxies and LRGs, and thus we want to avoid any additional 
uncertainty in the evolution corrections due to uncertain 
stellar populations. Secondly, an important part of this work 
is a comparison of the observations to various semi-analytic 
models, whose synthetic galaxy magnitudes also contain no 
evolution corrections. Finally, by applying evolution correc- 
tions to our central LRGs and redoing the analysis as a test, 
we find that our results are unchanged. 

A summary of the mean brightness and colour proper- 
ties of the LRG and satellite populations in our sample is 
shown in Table [T] As expected the central LRGs are signif- 
icantly brighter and redder than the surrounding satellite 
galaxy populations. 



Table 1. Summary of the LRG and satellite galaxy populations. 



Galaxies 


M r 




g-r 


a (g-r) 


LRG 


-23.3 


0.4 


0.94 


0.11 


Satellites 


-21.4 


0.2 


0.72 


0.05 



A characterisation of the mean and standard deviations of the 
brightness and colours of galaxies in our sample. All magnitudes 
are k-corrected to redshift zero, and colours are quoted in the 
rest-frame. The satellite measurements reflect the distribution of 
mean satellites properties within r2oo 7 and do not include the 
central LRG. 

3 HALO MASS ESTIMATES 

To assign total halo masses to objects in our catalogue, we 
use the correl ation between the cluste r's optical richness and 
its mass fe.g. lYee &; Ellingsonl 12003). This is a well-known 
strategy (e.g. |Voit|[2005|) , however, the devil is in the details: 
most importantly, the choice of the suitable richness mea- 
sure and the calibration of the mass-richness relationship. 
In this work, we choose to link cluster's richness to its mass 
via X-ray temperature. The scatter in the richness- X-ray 
temperature relationship is significantly smaller compared 
that of relationships between richness and X-ray luminos- 
ity or richness and velocity dispersion. Theoretically, X-ray 
temperature has also been shown to be a robust tracer of 
the underlying halo mass, less sensi tive to non-grav itational 
effects of energy redistribution fsee lVoit et al.l 12002). 

With this in mind, we assemble the "anchor" subset 
of SDSS clusters with published X-ray temperature mea- 
surements. These are used to calibrate the optical richness 
- X-ray temperature relationship. Finally, to connect rich- 
ness to halo mass , we employ the X-ray temperature-mass 
relationship from IVikhlinin et al.l ([2006) . One obvious dis- 
advantage of our method is that only a small fraction of our 
clusters have reported X-ray temperatures and therefore our 
calibration sample is but a small fraction of the entire sam- 
ple. However, our calibration sample covers the entire range 
of halo masses analysed in this paper and so we do not need 
to perform any extrapolation. Also, as we demonstrate be- 
low (see Section [6} , there is excellent consistency between 
our observed satellites profiles and those from semi-analytic 
models of galaxy formation for which we use the true halo 
mass. This agreement is non-trivial and implies that our halo 
mass estimates must be quite accurate on average. 



3.1 Mass- Temperature relation in the "anchor" 
cluster sample 

The X-ray cluster temperatures in our "anchor" sub- 
set are m ainly d rawn from th re e ca talogues: those 
(l200lh. iMaughan et ah| (|2008T ) and the AC- 



by IHc , . , y 

CEPT sample (Cavag nolo et al We also use three 



lower temperature sampl e s from iMulchaev et al" I EM) . 
lOsmond fc Ponmanl (|20o3 ) ISun eta l. (2009). There is some 
overlap between the different samples used, and in cases 
where a cluster is reported multiple times, we assign an 
average X-ray temperature to the object. In these dupli- 
cate cases, the scatter between different measurements of 
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the same object is typically within the mean la errors of the 
two measurements (^70 per cent within la and ^90 per cent 
within 2a). We find 129 unique X-ray clusters from the above 
samples within SDSS DR7, within a redshift range of 0.01- 
0.4. These catalogues furnish us with a tem perature, from 
which we calculate the mass according to IVikhlinin et al.l 
(|2006h . 

M 50 o M 5 / T \ a 

M© E(z) J ' [ } 

where rsoo is the radius which encloses the mean density that 
is 500 times the critical density of the Universe, M500 is the 
mass enclosed within rsoo, T is the temperature, E (z) is 
the correction for self-similar evolution and a = 1.58 ± 0.11 
and M 5 = (2.89 ± 0.15) x 10 14 h~ 1 M Q are determined by 
the sl ope of the mass-temperature relation. IVikhlinin et al.l 
<|2006h derived this relationship from 13 low-redshift clus- 
ters (with median mass of M500 ~ 5 x 1O 14 M0) under 
the assumption of hydrostatic equilibrium using spatially- 
resolved X-ray surface brightness and temperature profiles 
from Chandra. Na g ai et al. (2007) have tested the methods 
of IVikhlinin et all (|2006h on a set of clusters from cosmo- 
logical simulations and showed that the recovered masses 
are accurate to ~ 15% within rsoo- Recently, Sun et al. 
(2009) have shown that the m ass-temperature relation de- 
rived bv lVikhlinin et all (|2006h also extends down to lower- 
mass galaxy groups. This was shown using a large sam- 
ple of 43 groups observed with Chandra (median mass of 
« 8 x 1O 13 M ). 

3.2 Optical properties 

In the absence of any prior information about the virial ra- 
dius of the cluster, the strongest correlation between the 
halo mass and cluster's environmental properties seems to 
arise when the galaxies are counted in 1-2 Mpc aperture, as 
indicated by the analysis of the most recent N-body simu- 
lations (|Haas et al.ll201lh . To test the applicability of this 
measure to the SDSS DR7 data, we investigated the cor- 
relation between X-ray temperature (a proxy for the to- 
tal mass) and observables such as richness and luminosity. 
The SDSS photometric and astrometric data for each clus- 
ter field are obtained by running th e q3c radial query al- 
gorithm ([Koposov &; Bartunov| [2006) on a locally available 
SDSS DR7 database. This retrieves all galaxies within the 
five Mpc which satisfy the following magnitude and redshift 
conditions: 

r<21.5, \z-zbcg\ ^0.04(1 + zbcg), a z < 0.2, (2) 

where r is the extinction-corrected r-band model magnitude, 
z is the photometric redshift of neighboring galaxies and a z 
is the corresponding photometric redshift error. To ensure 
clean photometry we also exclude galaxies which satisfy the 
SATURATED photometric flag. The apparent magnitude cut 
was selected to be r = 21.5, as at this limit the SDSS com- 
pleteness is close to 100 per cent and the accuracy of star 
galaxy separation is >90 per cent. This limit was verified 
from r epeated SDSS Stripe 82 imaging. Following I Wen et al.l 
(2009), we use the variable photometric redshift gap, which 
ensures most cluster member galaxies are included regard- 
less of cluster redshift. We first calculate the total number of 
galaxies with K-corrected magnitudes ^ —20.5 in the fixed 



1 Mpc aperture placed on the cluster's centre (i.e., the BCG 
position) . To calculate the absolute magnitudes of the mem- 
ber galaxies, we assume that they all have redshifts equal to 
zbcg • Note that this absolute magnitude limit is chosen as 
it corresponds to the faintest galaxy observable (i.e. with 
apparent magnitude r < 21.5) at redshift 0.4, and ensures 
completeness at lower redshifts. We then gauge the char- 
acteristic background contribution by calculating the mean 
background density of galaxies (also with M r ^ —20.5 at the 
redshift of the BCG) in a series of annuli spanning a distance 
range of 2.5-5 Mpc from the cluster centre. As a check of this 
background density (in an annulus), we also compared with 
the galaxy density in randomly selected apertures, and the 
average difference between the density estimates was found 
to be independent of halo masfl The final object richness 
iViMpc is obtained by subtracting the estimated background 
contamination from the total count in the 1 Mpc cluster re- 
gion. The error estimate on the richness is derived from the 
poisson errors on the cluster aperture and the background 
combined in quadrature. The correlation between M500 and 
A/iMpc out to a redshift of 0.4 (and a magnitude limit of 
-20.5 in r) for our "anchor" sample is shown in Fig. [1] 

We also investigated correlations between M500 and 
other optical observables such as cluster luminosity and rich- 
ness within a scaled aperture (designed to maximise the sig- 
nal to noise of the cluster signal above the background). 
However, we found that either the scatter was increased 
in the case of luminosity, or, for the scaled aperture, the 
improvement was too insignificant to justify the additional 
complexity. In short, the simple 1 Mpc aperture proved suf- 
ficient to provide an adequate probe of halo mass with fairly 
low scatter over a reasonable range of cluster masses. 

3.3 Power-law models for mass-richness relation 

We chose to parameterise the correlation between mass and 
richness as follows 

log (^) = m]og Nim pc +b, (3) 

where slope m and interce pt b are det e rmine d using the ro- 
bust fitting algorithm of IHogg et "al ] (|2010h . The method 
reduces the impact of outliers, takes into account horizontal 
and vertical error bars, and also m odells the in t rinsic scat- 
ter of the correlation. According to IHogg et al.l (|2010h . the 
likelihood function which describes the linear model is given 
by 

N 1 N A 2 

W^-E^^ + VJ-E^ (4) 

where K is a constant, A* is the orthogonal displacement 
of each data point from the line (Appendix [A| , E» is each 
data points' orthogonal variance (Appendix |A|, and V is 
the intrinsic variance. This likelihood function is maximised 
to give the best fit model for the slope m, intercept b and 

4 The exact determination of background density is not impor- 
tant (provided we are internally consistent), as we are using this 
richness estimate purely to calibrate the halo masses. We use 
randomly placed apertures for estimating the background in our 
analysis of satellite number density profiles in Section 4. 
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V . The outliers are pruned by rejecting the lowest 5 per 
cent of contributing points to the likelihood function on each 
likelihood evaluation. The fit is found to be not too sensitive 
to the exact fraction of rejected points. For the "anchor" 
sample of clusters, the best-fit values are m = 1.4 ± 0.1, 
b = 12.3 ± 0.1 and V = 0.007 ± 0.002. 



3.4 Cluster masses 

We can use the scaling relation in Equation [3] to obtain 
M 50 o estimates for all optically identified clusters which do 
not have direct X-ray measurements. The distribution of the 
masses for all objects in our BCG-centred sample are shown 
in the right panel of Figure [1] 

In what follows, we adopt a lower mass limit of 
10 13 " 7 M© as this is where we have only few clusters for 
calibration and the scatter in our mass calibration becomes 
large (see left panel of Figure [TJ). At this limit, the sample in- 
completeness becomes evident as it coincides with the drop 
in the mass function of the sample as shown in the Right 
panel of Figure [1] Although this mass cut culls a large frac- 
tion of our catalogue, we are still left with a substantial 
sample of 20 157 clusters. 



3.5 The quality of the cluster catalogue 

Here we aim to check whether any significant impurity 
and/or incompletness could affect the conclusions drawn re- 
garding the radial distribution of satellites as a function of 
mass. 



3.5.1 Purity 

Purity is a measure of the degree of contamination in a sam- 
ple, i.e., in our case, the number of clusters that are real 
compared to spurious detections. In principle, these false 
detections which are caused by interlopers in photometric 
redshift space should be minimised by our imposition of a 
photometric redshift cut in Equation [2] 

We can assess the degree to which our sample suffers 
from impurity by calculating 'cluster' richnesses and implied 
masses in apertures placed randomly on the SDSS footprint. 
Although there exists a small chance that we will pick up real 
galaxy clusters in our randomly placed apertures, the prob- 
ability of this is low, and the averaged recovered richness 
should be distributed about zero. Therefore, the fraction of 
such 'objects' with the mass in excess of M 500 > 10 13 - 7 M© 
yields the upper limit to the impurity in our sample. 

The richness distribution of 1 000 randomly positioned 
apertures is plotted in left panel of Figure [2j along with the 
implied richness limit of 10.3. The resulting impurity frac- 
tion is rsj 3 per cent. While a small fraction of these impuri- 
ties will correspond to 'real' clusters, this simple calculation 
allows us to place a lower limit on the sample purity for clus- 
ters with M500 > 10 13 - 7 M© of - 97 per cent. Reassuringly, 
we find that this fraction does not change significantly with 
redshift. 



3.5.2 Completeness 

The incompleteness of a sample corresponds to the number 
of real clusters that are missing, i.e. in our case the number 
of true galaxy clusters with M500 > 10 13-7 M© which are 
lost below our imposed mass limit due to Poisson errors. 

To assess the degree of incompleteness in our sample we 
generate mock catalogues and recover their properties using 
the techniques described in Sections 13.21 and 13.31 A galaxy 
cluster field is simulated by sam pling from the projected 
NFW profile (Ba rtelmannl I1996T ) with a given input halo 
mass (M500), a nd a concentration ^] determined according to 
the relation in iDuffv et al.l (|2008h . The total normalisation 
of the NFW number density in a simulated cluster is given 
by: 



A/tot = iVi Mp. 



Jo rvir / 0) r dr 
Jo f 0) r dr 



(5) 



where Ni m P c is given by the mass-richness relation 
(Equation |3L f(r ) is the projected NFW density profile 
( Bar telmannl ll996) . and r is the radius from the cluster cen- 
tre in Mpc. To account for Poisson- like scatter at low cluster 
occupations, the number of samples generated for the given 
simulated object is Ntot, which is drawn from a Poisson dis- 
tribution with A =Ntot- We then include a flat background 
galaxy distribution over the 5 Mpc aperture. We find that 
the background density of bright galaxies (M r < —20.5) 
does not vary significantly with redshift in SDSS DR7, and 
so the background density is drawn randomly from a Gaus- 
sian with mean fib = 3.38, and standard deviation ab — 0.79, 
in units of galaxies per Mpc 2 . 

A mock 5 Mpc cluster field is then analysed according 
to the prescriptions in Sections 13.21 & 13.31 and the implied 
halo mass M500 is obtained using mass-richness relation. 
The process is repeated 1000 times to generate a distribu- 
tion of M500 values for a given M500 value. This output 
distribution resembles a Gaussian centered on M^q . The 
fraction of M500 values which fall below the mass limit cor- 
responds to the incompleteness of the sample at that M^qq. 
The completeness /complete as a function of M500 is shown 
in the right panel of Figure [2] As expected, /complete falls to 
~ 50 per cent at the mass limit, and the completeness in the 
lowest mass bin (10 13 7 < M 50 o < 10 14 M©) is - 70 per 
cent. Reassuringly, as M500 is increased the curve rapidly 
approaches 100 per cent completeness. 

A later result of this paper states that satellites in clus- 
ters are approximately half as concentrated as the dark mat- 
ter. It is therefore important to ascertain whether the com- 
pleteness of the sample is sensitive to the concentrations of 
the simulated haloes to prevent a circular argument. In our 
simulations, we find that the selection function (right panel 
of Figure [2} is insensitive to the concentration adopted in 
the range 2 < c < 10. This is because the aperture used for 
the richness calculation (see Section \3. 2 [) is sufficiently large 



5 Our work (Section 15.20 , and the work of others suggest that 
satellites are found to be a factor of ~ 2 less concentrated com- 
pared to the parent dark matter haloes, and therefore we adopt 
a concentration of Cd m /2 for the simulated satellites. 

6 This helps explain why the sample purity does not change sig- 
nificantly with redshift. 
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Figure 1. Left: Mass within rsoo for 129 "anchor" clusters out to redshift 0.4 as a function of optical richness iViMpc- The richness 
estimate is complete to the magnitude limit of M r ^ —20.5. The solid line shows the best-fitting power-law which is used to evaluate 
^500 fo r other clusters in our catalogue. The horizontal dashed line represents the minimum mass adopted for the sample, and the 
dotted lines (representing ±y/V) indicate the modelled intrinsic scatter in the relationship. Right: The distribution of masses in the 
sample obtained via the mass-richness relation shown in the Left panel. The sample suffers from incompleteness at the low mass end. 
The vertical dotted line shows the minimum mass adopted for the profile stacking analysis. 



to ensure that satellites in low concentration haloes are still 
predominantly located inside 1 Mpc from the cluster centre. 

It is empirically known that not all massive galaxies are 
red. In particular, BCGs sitting at the centers of massive 
clusters with very short central cooling times often show 
signs of active star formation and young stella r populations 
(e.g.JCrawford et al.| [l999: Bildfell et al. 2008; R awle et al.1 
2012). In these cases, the true (blue) BCG will not be des- 
ignated as a BCG according to our method. But note that 
as long as the cluster has at least one LRG it will be part 
of catalog (subject to it being overdense with respect to 
the background and above the adopted mass cut), even if 
the LRG is not the true BCG of the cluster. To our knowl- 
edge, there is no evidence for massive groups and clusters 
(i.e. with at least 10 members with r < —20.5) with an en- 
tirely blue galaxy population (i.e., no LRGs). Therefore we 
do not expect that this 'blue BCG' effect has implications 
for our completeness calculation. Furthermore, so long as 
our mass-richness relation is valid, we should be picking out 
the same types of groups and clusters in the observations 
and the models we compare to in Section 6 (even though 
the selection process is not identical in a procedural sense). 

In the cases where the BCG is blue, the centering from 
our procedure (which would pick the brightest LRG) would 
be inaccurate. But as we show in Section 4.2.2, the effects of 
inaccurate centering are minor in general and do not affect 
our main results or conclusions. We can estimate the pro- 
portion of 'blue-BCG' clusters which may be mis-centered 
due to this effect by looking at the results of lCrawford et all 
(1999), who look at the amount of star-formation in BCGs 
in a modest sample of 216 ROSAT clusters. We find 149 of 
these clusters within SDSS, and of these, only 39 (i.e. 26 per 
cent) have detectable H-a emission. We then cross match 



with the LRG sample and find an overlap of 20 with both an 
LRG and H-a emission. Therefore, by this logic we are mis- 
centering ~50 per cent of clusters with strong H-a emission 
in the BCG, which translates to only ^13 per cent of all 
clusters. 



3.5.3 Comparison to other cluster catalogues 

We can compare our cluster sample statistics with those 
of other published catalogues, e.g . the MaxBCG sample 
(|Koester et afll2007b. catalog u es bvlWen et al.1 (|2009h (here- 
after WHL) and lSzabo et al. (| 20 111 ), and the recent updat e 
to the MaxBCG - the GMBCG sample (|Hao et aljl20ich . 
These use slightly different methods of cluster identification 
ranging from searching for a red-sequence in photometric 
data (MaxBCG), to the use of a friends-of- friends algorithm 
(WHL) . The purity and completeness above the given mass 
limit for these published cluster catalogues are provided in 
Table [2] 

In general, the previously published catalogues yield 
both purity and completeness of ~ 90 per cent for clusters 
with M500 > 10 14 Mq. Our catalogue with the purity of 
> 97 per cent and completeness of > 95 per cent for haloes 
with M500 > 10 14 ' 2 M©, is comparable in quality to the 
previously published work. 

As a final test, we repeated one of the later results in 
our paper (stacking the satellite radial profiles as a func- 
tion of halo mass; Section [5.1[) , using the MaxBCG cluster 
sample. The result is included in Appendix[Bj and is entirely 
consistent with the result determined using our sample. The 
consistency between the MaxBCG catalogue and our sam- 
ple provides another confirmation that it is the physics of 
cluster formation, and not the selection, purity or incom- 
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Table 2. Purity and completeness in other cluster samples. 



Survey 


M^g/M© 


Purity 


Completeness 


This work 


~7.5el3 


>0.97 


-0.90 


MaxBCG 


lel4 


0.9 


0.85 


GMBCG 


~lel4 


~0.9 


-0.95 


Szabo et al. 


lel4 


0.9 


0.85 


WHL 


2el4 


0.95 


0.9 



pleteness in optical surveys which determines the observed 
radial profiles of galaxies in clusters as a function of halo 
mass. 



4 SATELLITES IN CLUSTERS 

4.1 Construction of the number density profiles 

For each cluster in the catalogue, galaxies satisfying the con- 
straints in Equation [2] are extracted within five Mpc of the 
BCG centre. Then from a random ty • 5 2 Mpc 2 patch on 
the sky, galaxies at the redshift of the cluster are selected 
to represent to the background distribution. We calculate 
the physical radial distances of all neighbour galaxies with 
respect to the BCG position and construct their radial dis- 
tributions. This process is repeated for all clusters in a given 
sub-set (e.g., halo mass bin) to yield a high signal-to- noise 
number density profile. The mean background level is ob- 
tained by co-adding random background fields to give the 
average background density, which is then subtracted from 
each radial bin. Finally, the stacked profiles are divided by 
the total number of clusters which have contributed to the 
stack to give the mean number density profile. We provide 
an estimate of the error on the mean due to Poisson scatter. 
The true error bar will not be just Poisson distributed due 
to the fact that there is scatter in the shapes and normali- 
sations of satellite profiles within the stack. This scatter is 
taken into account in the full modelling procedure described 
in Appendix[Cl and the characteristic error bar as a function 
of radius in the satellite profiles is calculated accordingly. 



4.2 Selection effects 

4.2.1 Bright galaxy obscuration 

As shown below the stacked satellite density profiles gen- 
erally show a flattening in the centre, at distances 20-30 
kpc from the BCG (e.g. Figure [5}. This flattening may 
have a physical explanation but it could also simply be due 
to the faint satellite galaxies being swamped by the light 
of the bright central BCG in the SDSS imaging data. We 
therefore test the ability of the SDSS photometric pipeline 
(|Lupton et alJlipOll ) to resolve the galaxies in the wings of 
the central BCG by comparing object counts in the fields 
that have been observed with both SDSS and Hubble Space 
Telescope (HST). The SLACS (Slo an Lens ACS Surv ey) 
galaxy-galaxy strong lensing survey ([Bolton et al.ll2008l ) is 



an ideal sample for such comparison. We use a sub-setQ of 38 
SLACS lenses with large BCGs with redshifts 0.1 < z < 0.3. 

We obtain the ACS images from the MAST online 
archival and measure radial positions, galaxy /stellar type 
and l uminosities of pos sible satellites using SExtractor soft- 
ware (jBertin fe Arnoutdfl996h . We calibrate the magnitudes 
returned by SExtractor to the SDSS magnitudes by cross- 
matching positions of galaxies in the HST images and the 
SDSS database, and perform the apparent magnitude cut of 
r = 20.35. This cut is the apparent magnitude correspond- 
ing to the absolute magnitude limit of —20.5, at the mean 
redshift of our BCG cluster sample ((zbcg) = 0.29). As well 
as the loss of satellites due to the central BCG obscuration, 
we also assess the potential miss-classification of stars as 
galaxies by the SDSS pipeline in the following analysis. 

The efficiency of detecting BCG satellites in the SDSS 
compared to the HST as a function of radius is given by: 



fdet — 



Ci,hst; £i jS dss &ti,hst — galaxy) 



(ti.hst = galaxy) 



(6) 



where d, x corresponds to the central position of the satellite 
and U jX corresponds to the star/galaxy type as classified in 
the respective survey x. We allow for the centering error of 
1" when matching satellites between the two surveys. The 
fraction /det as well as the numerator and the denominator of 
Equation [6] are plotted in Figure [3] It is clear that the SDSS 
detection efficiency rapidly drops below 90 per cent when 
the separation from the BCG decreases below ~ 5". This 
number is used to define the boundaries of the region around 
the BCG which is affected strongest by incompleteness. 

We have also investigated whether or not the second 
brightest galaxy could obscure fainter satellite galaxies and 
produce non-physical features in the radial profiles. This 
scenario is tested by splitting the catalogue into faint and 
bright samples based on the luminosity of the second bright- 
est galaxy (SBG) within 200 kpc of the BCG. The obscu- 
ration effect, if present, would show an enhanced decrement 
in the radial profile of the clusters with the brighter SBG. 
Such a decrement is not detected and therefore we conclude 
that the SBG obscuration is not significant. 

Another possible selection effect that we tested for is 
the degree of galaxy-galaxy overcrowding and hence obscu- 
ration in dense clusters as a function of redshift. The radial 
profiles of the clusters at high redshifts could be systemati- 
cally affected in the inner regions as the constituent galax- 
ies appear closer together in angular space, and, hence, are 
more difficult to resolve with a modest size PSF. We in- 
vestigated this effect by calculating the covering fraction 
(fraction of physical area covered by galaxy light profiles) 
within 200 kpc of the BCG, for the highest mass BCG clus- 
ter^ M500 > 10 14 - 5 M Q . The galaxy light is assumed to be 
entirely contained within twice the petrosian radius found 
in the SDSS database. We found that the covering fraction 



7 The entire SLACS sample consists of over 100 galaxy-galaxy 
lenses, but we restrict our sample to a small sub-sample observed 
with one orbit of ACS-WFC F814W imaging. 

8 http://archive.stsci.edu/hst/, cycle 15, proposal 10886. 

9 The over-crowding of galaxies is most severe when there is dense 
clustering in the highest mass objects and so this provides an 
upper limit on the effect. 
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Figure 2. Left: Richness iViMpc of 1000 fields placed randomly in the SDSS footprint. The fraction of objects above the richness limit 
marked by the vertical line approximates the sample's purity. Right: Completeness fraction as a function of halo mass for the cluster 
sample. The vertical line shows the imposed mass limit of 10 13-7 Mq corresponding to the richness cut-off in the Left panel. 




Figure 3. Left: Detected number of satellite galaxies in SDSS (solid) and HST (dashed) for the SLACS sample imaged with the ACS. 
Right: Efficiency of detecting satellites in SDSS as a function of distance from the BCG. 



never exceeds the level of 0.01 and therefore is not a signif- 
icant source of bias on our results. 



4-2.2 BCG - cluster mis- centering 

There has been considerable debate regarding the choice 
of the cluster centre when computing the satellite pro- 
file, especially the effect of possible ar tifacts caused by the 
choice of BCG as the cluster's centre (ICarlberg et aL l ll997l : 
lHansen et all 1 20051 : Ivan den Bosch et al.ll2005h ! 

Although there are clear reasons for stacking on the 



BCG centre, we have investigated the possible biases by 
comparing the profiles stacked on the BCG with the pro- 
files stacked around the fitted centre of the satellite galaxy 
distribution. A tentative cluster centroid is found by fitting a 
simple overdensity model consisting of constant background 
and an over-density with exponential radial profile, to the 
satellite galaxy field (see Budzynski et al., in preparation 
for details). To ensure that the 2D density fit is reliable, 
we restrict our test cluster sample to contain only signifi- 
cant over-densities above the background within 300 kpc of 
the BCG, which corresponds to >4-sigma Poisson fluctua- 
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Figure 4. Mean satellite number density profiles as a function of 
radius for ~ 9 700 clusters centred on the BCG (green) and fitted 
centroid to the galaxy distribution (red). The black points show 
the representative scatter between profiles within each stack, and 
the shaded grey region shows the mean error bar due to Poisson 
scatter as a function of radius. The profiles are very similar be- 
yond ~ 0.06 r5oo, which corresponds to the region affected by the 
BCG obscuration. 



Figure 5. Mean satellite number density profiles for subsets of 
clusters in bins of halo mass. The black points show the represen- 
tative scatter between profiles within each stack, and the shaded 
grey region shows the mean error bar due to Poisson scatter as a 
function of radius. It is clear that the higher mass objects have a 
larger number of satellites compared to those in lower mass bins. 
The shaded (dotted) region represents the area of incompleteness 
in the number density profiles due to obscuration by the BCG. 



tions. The distribution of offsets between the BCG and the 
fit centre for ~ 9 700 significant clusters with is found to 
be a Gaussian centred around zero with the standard devia- 
tion of 100 kpc. Importantly, the offset is not found to vary 
significantly with halo mass or redshift. Figure [4] shows the 
comparison between the cluster profiles centred on the BCG 
and those centred on the fitted centroid. 

Encouragingly, the profiles differ only slightly beyond 
the BCG obscuration threshold of ~ 0.06 r 500, which indi- 
cates that the miscentering of BCGs will not strongly affect 
the conclusions drawn about the radial profile of galaxies 
beyond this radius. 



5 RADIAL PROFILE RESULTS 
5.1 Mass bins 

In order to study the mass-dependence of the satellite pro- 
files we have split the sample into four bins ranging from 
10 13 - 7 to 1O 15 O M . The radial profiles (Figure are ob- 
tained according to the method described in Section WA\ In 
the Figure, the shaded (dotted) region corresponds to the 
region in physical space strongly affected by the BCG ob- 
scuration effects modelled in Section [4~2l As expected, the 
radial profile of the higher mass bins have a larger normalisa- 
tion than the lower mass bins. This is due to the underlying 
mass-richness relation (Equation [3]). 

The choice of the four mass bins is motivated by the 
degree of scatter in the mass-richness relation (Figure [T]). 
Ideally, one would like to divide the sample into multiple 



mass bins. However, the situation is made more complicated 
by the fact that the mass function of haloes is not uniform 
(right panel Figure [1]), and also that there is potentially 
significant scatter of halo masses between bins. The choice 
of bin size is therefore a trade-off between needing enough 
bins to probe the physics of cluster formation which varies as 
a function of mass, and having bins wide enough to not have 
significant mass overlap due to mass measurement errors. 

We can model the degree of scatter between the mass 
bins by making use of the mock cluster simulations described 
in Section [3.5.21 The mock clusters are analysed, and mass 
estimates are obtained according to the prescription in Sec- 
tions [32] & [32] For a given input cluster mass, the recovered 
mass distribution from the simulation resembles a Gaussian 
about the input value, with a small degree of assymmetry 
owing to the nature of the Poisson process. To measure the 
effect of scatter between bins, we first model the scatter at 
the centre of a mass bin and convolve the recovered distri- 
bution with a square kernel with width equal to the size of 
the bin. This convolution widens the initial Gaussian, which 
represents the fact that the true cluster mass distribution is 
not entirely located at the centre of the bin. A further con- 
volution is then required to account for shape of the cluster 
mass function (see right panel Figure [1]), which is not uni- 
form across each mass bin. The extra convolution causes 
the centre of the widened Gaussian to be offset to the left of 
the bin. This effect is more pronounced in the higher mass 
bins as the mass function is steeper. The resulting bin-to- 
bin scatter is shown in Figure [6] The lowest mass bin is 
strongly affected by the inclusion of real groups and clusters 
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Figure 6. Distributions of recovered masses for simulated clusters 
in four mass bins. The scatter becomes larger as we approach the 
mass limit of our sample (10 13 7 Mq). The distributions in the 
higher mass bins are slightly offset to the left of the bin centre due 
to the steepness of the overall halo mass function at high mass 
end (see right panel of Figure [T}. 

with true masses below the mass limit. This effect corre- 
sponds to 50 per cent contamination for the lowest mass 
bin, 5 per cent for the second lowest bin and is negligible 
for the higher mass bins. Despite the presence of significant 
scatter between mass bins (particularly at the lowest halo 
masses), distinct mass distributions within these four bins 
can be clearly seen. We can therefore be reasonably confi- 
dent that any changes in shape seen in the radial profiles as 
a function of halo mass are real. 

It has been known for some time that the dark mat- 
ter mass density profiles look self-similar, i.e. no strong 
change in shape o f the profile as a function of mass, (e.g. 
iNavarro et al.Nl997l ): if the radial coordinate in the profile 
is scaled by a characteristic radius, e.g., the virial radius, 
the resulting density distributions for systems with different 
total mass appear nearly identical^! It is interesting to see 
whether or not such self-similarity also holds for the satellite 
galaxy radial profiles. 

We produce scaled radial satellite profiles by dividing 
both the distance and the number density by rsoo (the latter 
is required since £ p is a surface number density therefore 
we must take into account that more massive objects have 
larger line-of-sight lengths in physical units). The stacked 
profiles are normalized by the mean value of rsoo for the four 
mass bins we consider. The mean value of r^oo is calculated 
as 

10 They are not exactly identical because lower mass systems 
are slightly more concentrated than higher mass systems, which 
reflects the fact that lower mass systems collapsed earlier on av- 
erage at a time when the background density of the Universe was 
higher. 
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Figure 8. Concentration as a function of mass for the observed 
satellite number density profiles (triangles) and the Millenium 
dark matter clusters (squares) shown in Figure The solid line 
corresponds to t he predicted mass-concentration relations from 
iGao et al l d2008h . 

(rsoo) = ( 2000^) (Msoo) ) ' ' (7) 

where p c (z) is the critical density of the universe at the 
redshift of the cluster, and (M500) is the mean mass in a 
given mass bin. 

Figure [3 shows these scaled satellite profiles for the four 
mass bins considered. Remarkably, this scaling procedure 
removes most of the mass dependence in the satellite density 
distributions. Only a small residual difference remains in 
that the radial profiles of lower mass haloes appear to be 
slightly more peaked compared to those of the high mass 
clusters. 

5.2 Satellite concentration 

How well do satellites trace the underlying dark matter dis- 
tribution? We can test whether the satellite behaviour is 
matched by the overall dark matter halo density distribution 
by looking at the projected dark matter density p rofiles from 
the Millenium Simulation ( Springel et al.ll2005r ). We select 
haloes in the same mass bins as the observations and create 
density profiles as a function of r/rsoo- The DM simula- 
tion profiles are arbitrarily normalised by multiplying them 
by a constant factor so that the profile corresponding to the 
[14.0-14.4] halo mass bin passes through the observed galaxy 
profile at 3r5oo (right panel of Fig. [TJ) . As expected, the dark 
matter profiles are nearly self-similar. Interestingly, the dark 
matter profiles exhibit the same shape dependence as a func- 
tion of mass as the satellite profiles (i.e. slightly more peaked 
at lower mass) . This dark matter shape change is well docu- 
mented in terms of the mas s-concentrat i on relationship in N- 
body simulations (Nav arro et aPll997l : iBullock et afl koOl: 
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Figure 7. Comparison of the observed scaled profiles with the simulated ones. Left: Mean satellite number density profiles split into bins 
of halo mass. Both axes have been scaled by the mean rsoo in a given bin to remove the mass dependence. The black points show the 
representative scatter between profiles within each stack, and the shaded grey region shows the mean error bar due to Poisson scatter as 
a function of radius. The shaded (dotted) region represents the area of incompleteness in the number density profiles due to obscuration 
by the BCG. Right: Observed satellite number density profiles (solid black dots) in a single mass bin (from 10 13-7 — 10 15 Mq) compared 
to the dark matter profiles of the Millenium clusters in the mass bins shown in the left panel (colour curves). The normalisation of the 
dark matter profiles have been multiplied by an arbitrary (constant) factor. 



iDuffv et al.ll2008l : lGao et al.ll2008h . in which the lower mass 
haloes appear more concentrated compared to the high mass 
ones. This concentration dependence is thought to be due 
to the fact that more massive haloes collapse at a later cos- 
mic time, where the background density of the universe is 
lower. It is not immediately obvious, though, that the satel- 
lites should preserve this trend, since presumably a large 
fraction of the satellites we see in orbit today were accreted 
not that long ago. The fact that the overall shapes of the 
satellite profiles are systematically different from the dark 
matter but that both are individually nearly self-similar is 
therefore quite intriguing. 

We can quantify the difference in the shapes of the dark 
matter and the satellite density distributions by measur- 
ing their concentrations as described by the NFW profile 
(Figu re El). We fit the projected NFW profile (|Bartelmannl 
1996) to the satellite density profiles over the radial region 
0.07 < r/r 500 < 1.5. Figure [8] shows the obvious drop in halo 
concentrati on as a fun c tion o f mass. The best-fit power-law 
relation of iGao et al.1 (|2008T ). which was derived from fit- 
ting over a much wider range of halo masses, is over-plotted 
for comparison^!. One of the most striking observations is 
that the concentration of satellites is roughly a factor of two 



11 The small discrepancy between our derived mass- 
concentration relation for th e simulate d clus ters and the 
best-fit power-law relation of IGao et al] (2008) may be the 
result of those authors fitting over a wider range of halo masses. 
Another difference is that our concentrations are derived by 
fitting to the projected surface mass density profiles, rather than 
to 3D density profiles. 



smaller than the dark matter across the full range of halo 
masses we have explored. 

This systematic difference between the projected DM 
and galaxy radial distributions is generally consis tent with 
the f indings of previous studies. For example, I Lin et all 
(|2004h in their study of 2MASS clusters in the K-band, find 
that the distribution of satellites has the mean concentra- 
tion of 2.9. Additionally, tw o sepa rate an alyses of the CNOC 
cluste rs by ICarlberg et al] (|1997T ) and | van der Marel et al. 
(2000) find the concentrations of 3.7 and 4.2 respectively. Al- 
though our concentrations are roughly a factor of 1.5 lower 
than these studies, we find a general qualitative agreement 
in that we see a relatively shallow dependence of the con- 
centration on the halo mass. This shallo w dependence is ap - 
pare ntly discord a nt wit h the findings of lHansen et al] ((2005) 
and IChen et al] ([2006), who find that the concentration is 
a very strong function of optical richness. It is likely that 
this difference can be attributed to the differing character- 
istic radii used to scale the radial coordinat es of the pro- 
files (s ee the discussion in Section 1). Namelv jHansen et al] 
(2005|) define their characteristic radii (R200) with respect 
to the background density of galaxies, whereas our charac- 
teristic radii are denned with respect to the critical den- 
sity of the Universe from X-ray measurements (see Equa- 
tion [7]). Indeed, using clusters in common between our cat- 
alog and that of the MaxBCG catalog, we find a scaling 
of r2oo ~ 0.65r5oo ~ 1.4R|q , which accounts for the steep 
richness dependence of their derived concentrations. 

Finally, a possible explanation for the offset between 
dark matter and satellite concentration is the fact that the 
galactic subhaloes are strongly affected by tidal evolution 
and merging process within the inner regions of the cluster 
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(jNagai fc Kravtsovl 120051 : IChen et alJl2006j ). We will inves- 
tigate the physics of subhalo disruption and merging in the 
context of semi-analytic galaxy formation models in Section 

El 



5.3 Satellite profile shape as a function of other 
properties 

Since the radial distribution of satellite galaxies does not 
vary strongly with halo mass, we can combine all galaxies 
into one cluster mass bin, ranging from 10 13-7 to 1O 15 O M0, 
and investigate the dependence of the profile (particularly 
its shape) on other properties. We now investigate whether 
the stacked radial profiles of satellite galaxies depend on 
redshift, BCG luminosity (or BCG luminosity fraction), and 
the luminosity and colour of the satellites themselves. 



5.3.1 Redshift 

Splitting the sample according to cluster redshift reveals a 
small change in the shape of the derived radial distribution 
(Fig. [9] (a)). In particular, within r ~ 0.2r2oo, the profile 
is slightly shallower in our lowest redshift bin with respect 
to that of the intermediate and high redshift bins. Beyond 
this radius, the shape and normalisation of the profiles in all 
three redshift bins are consistent with no evolution. 

For reference, cosmological N-body simulations predict 
that the concentration should increase w ith decreasing red - 
shift for a cluster of fixed mass (see, e.g. JPuffv et ahlfeoOSh . 
However, over the relatively narrow range of redshift that we 
probe here, the DM concentration for a cluster of fixed mas s 
would only vary by « 7.5% according to lDuffv et al.l (|20Q8h . 
In any case, it is not immediately obvious that the shape of 
the galaxy and DM radial distributions should evolve by the 
same amount or even in the same sense. Indeed, dynamical 
friction and tidal stripping/disruption could plausibly act 
to reduce the concentration of the satellite distribution (for 
clusters of fixed mass) over cosmic time. 



5.3.2 BCG luminosity 

Splitting the sample according to relatively faint/bright 
BCGs (about the median BCG magnitude of M r — —23.3), 
reveals a small (but statistically significant within the Pois- 
son error bars) difference between the two samples (panel 
(b) of Figure [9}. Naively speaking, we might expect to see 
a central decrement in the satellite profiles for clusters with 
brighter than average BCGs (assuming the BCG is built 
from cannabalizing satellites). However, the observed trend 
actually goes the other way. A more mundane explanation 
for the trend is that there is a known dependence (albeit a 
relatively weak one) on the luminosity of the BCG and total 
cluster mass. When we split the sample based on BCG frac- 
tion (ratio of BCG luminosity to the background-subtracted 
cluster luminosity within 1 Mpc), the shape dependence of 
the satellite profiles all but disappears. We therefore find 
that when the halo mass dependence of the BCG luminosity 
is taken out, there is no obvious dependence of the satellite 
radial distribution on BCG luminosity. 



5.3.3 Satellite luminosities 

We turn our attention now to the brightness of the satellites 
themselves and investigate the dependence of the shape of 
the profile in four satellite luminosity bins over the range 
-20.5 > M r ^ -24.0 (panel (c) of Figure [9]). We find that 
the concentration of satellites falls slightly as their bright- 
ness in creas es. This is opposite to the finding of iLin &; Mohrl 
(|2004h and IChen et al.l (|2006h who find that the fainter 
satellites show a small dip in the central regions relative 
to the bright satellites. However, these studies only find a 
very small difference between faint /bright satellites and also 
stack a far fewer number of clusters than the number we 
present in this study. 

Our results imply that the dwarf-to-giant ratio (DGR - 
the ratio of faint-to-bright galaxies) , increases into the centre 
of the halo. This conclusion is in qualitat ive agreement with 
the work of IZabludoff fe Mulchaevl (|20Q0h . who find that the 
DGR decreases with halocentric radius albeit in a sample of 
lower masses and smaller red shifts than probed in this work. 
IZabludoff fe Mulchaevl <|2000h propose a number of scenarios 
to explain the behaviour of the DGR with radius, particu- 
larly that brighter and more massive galaxies are subject 
to larger amounts of dynamical friction and thus more fre- 
quently merge with the BCG than their fainter counterparts. 
This effect leads to a deficiency of bright, massive galaxies 
in the centre of the halo. Our results would therefore sug- 
gest that such a mechanism may occur in the high mass 
end of clusters as well as in t he poorer groups looked at by 
IZabludoff fe Mulchaevl (|2000h . 

5.3.4 Satellite colours 

A split of the satellite profile stacks into red and blue satel- 
lites (done about the median colouiF^l of g — r^ 0.72) yields 
a striking difference between their profile shapes. An im- 
pressive deficit[3 of blue satellites in the centre of the halo 
relative to red satellites is consistent with what is found by 
ICollister fc Lahavl (2005), who observe that the profiles of 
red galaxies (c = 3.9) are much more concentrated than 
those of blue galaxies (c = 1.3) in a large sa mple of 2PIGG 
group s. This conclusion is echoed by Weinmann et al.l (|2006l 
120091 ) who find that the early-typqij fraction of galaxies de- 
creases with increasing halocentric radius, and vice- versa for 
late- types. 

A couple of mechanisms commonly invoked to ex- 
plain the red/blue behaviour are ram-press ure stripping 
and strangulation. Ram-pressure stripping (|Gunn &; Gottl 
Il972h causes rapid removal of cold gas, and strangulation 
(Balo eh et aD 2000) occurs when satellites are deprived of 
their hot gas reservoir. Both processes will result in a su- 
pression of star formation in the satellite galaxies and can 
transform late- type galaxies (blue) into early- types (red). 
These processes which quench star- format ion become more 

12 k-corrected to redshift zero. 

13 It is worth noting that the slight rise at ~ O.lrsoo in the 
profile of blue galaxies could be an artifact possibly caused by the 
differential effects of BCG obscuration for the red/blue galaxies. 

14 'Early/late- type' is a definition based on the star-formation 
rate of the galaxy, but observationally these are very strongly 
correlated with red/blue colour of galaxies. 
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Figure 9. Mean satellite number density profiles split into bins of cluster redshift, BCG luminosity, BCG luminosity fraction and satellite 
magnitude limit. The black points show the representative scatter between profiles within each stack, and the shaded grey region shows 
the mean error bar due to Poisson scatter as a function of radius. All horizontal axes have been scaled by the mean rsoo m a given bin 
to remove the mass dependence. The shaded (dotted) regions represent areas of incompleteness due to BCG obscuration. Top Left (a): 
Cluster sample is split by redshift. Top Right (b): The sample is split according to the mean BCG magnitude. Bottom Left (c): The 
sample (limited to z < 0.3) split by absolute magnitude limit of the satellites. There profiles have been arbitrarily normalised further 
out to ~ 3r2oo- Bottom Right (d): Sample split by satellite colour (i.e. g — r brighter and fainter that 0.72). 



severe as we move towards the centre of the halo, which ex- 
plains the large observed discrepancy between red and blue 
galaxies within rsoo- 



6 SEMI- ANALYTIC MODELS 

We can attempt to shed some light on the observed be- 
haviour of the satellites in clusters (Section [5]), by compar- 
ing our results with predictions from semi-analytic models of 
galaxy formation. In partic ular, we extr a ct pro j ected num- 
ber de nsit y profiles for the iBower et al.l (200d). lFont et al.l 
( 2008|) and lDe Lucia fc Blaizotl (|2007T ) models using the Mil- 



lennium Simulation SQL databasJ^l. We select all clusters 
from z — 0.32 snapshot for which logMsoo ^ 13.7, yield- 
ing a sample of over 3000 simulated clusters. To mimic the 
procedure used to derive the observed profiles, we extract 
galaxies within a 5 Mpc aperture centered on the position 
of the most bound particle (which should correspond to the 
location of the BCG), perform a statistical background sub- 
traction using a randomly placed 5 Mpc aperture placed, 
and retain only galaxies with a absolute rest-frame r-band 
magnitude > —20.5. 

In Fig [10] we show the stacked satellite profiles in four 

15 http://virgo.dur.ac.uk/ 
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Figure 10. Comparison between the observed satellite number density profile (solid black dots) and those from a series of semi-analytic 
models of galaxy formation (colored curves). The black solid line is a stack of all BCG clusters in the sample. The black error bars 
show the representative scatter between profiles within each stack, and the shaded grey region s hows the mean error bar due to Poisson 
scatter as a function of radius. The shaded (dotted) regions repre sent areas of incompleteness due to B CG obscuration. Panels (a),(b) , 
and (c) correspond to the semi-analytic model satellites profi les oflBower et al.l (|2006[) . | Font et a l. (2008) and lDe Lucia & Blaizot (2007) 
respectively. Panel (d) shows the model satellites profiles of iDe Lucia fc Blaizotl (j2007|) when only satellites with existing dark matter 
subhaloes are included. 



mass bins (the same ones as in Fig. [9} for the 3 models, along 
with a single stacked observational profile comprised of all 
the clusters in the sampkE! We will discuss the bottom 
right panel below. 

Underlying the bower et al.l (j2006T ). iFont et all <|2008h 
and lDe Lucia &; Blaizotl ( 2007 ) models are the same friends- 
of-friends (FoF) and substructure catalogs, which were pro- 
duced by running the Subfind algorithm (Springel et al. 
2001) on the Millennium Simulation. There are some slight 



16 The limited dependence of observed satellite concentration on 
halo mass in Fig [8] implies that we can reasonably combine all 
clusters in the sample into a single universal observational profile 
for comparison with the models. 



differences in the merger trees constructed from these cat- 
alogs by the Durham and Munich groups and significant 
differences in the implementation of baryonic physics (such 
as radiative cooling, stellar evolution, supernova feedback, 
AGN feedback and so on). In principle the models could 
therefore produce quite different results. However, it should 
be borne in mind that all three models have been tuned to 
match the global galaxy luminosity function. What this tun- 
ing effectively achieves is to assign approximately the correct 
amount of luminosity /stellar mass to the central galaxy of a 
given dark matter halo (i.e., similar to what is achieved by 
direct 'abundance matching', but without explicitly impos- 
ing a monatonic relation between luminosity /stellar mass 
and halo mass or a constant amount of scatter at fixed halo 
mass). Our expectation, therefore, is that all three mod- 
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els ought to be similar at large radii and should, at least 
roughly, reproduce the observed unsealed satellite profiles 
there, where the effects of tidal evolution and dynamical 
friction should be minimal. Note that the model and ob- 
served scaled radial profiles will only match in normalisa- 
tion at large radii if our estimates of rsoo are accurate, since 
we are using empirical estimates of rsoo for the observed 
clusters and the true values of rsoo for the simulated sys- 
tems. As can be seen in Fig [10] all three models have similar 
behaviour at large radii and approximately match the ob- 
served profiles both in terms of shape and (encouragingly) 
normalisation. There is also a lack of a strong halo mass 
dependence in shape/normalisation of the profiles from the 
models, exactly as observed. 

Within rsoo difference s between the models becom e ap- 
parent. In particular, the iDe Lucia &; Blaizotl (|2QQ7h pro- 
files become noticeably flatter, in rough accordance with 
the observ e d pro files, whereas the iBower et al.l ([2006) and 
iFont et all (|2008h profiles show a smaller degree of flatten- 
ing. At first sight, this may seem suprising, as all three mod- 
els use the FoF and substructure catalogs. But note the dif- 
ference between the models originates from differences in the 
treatment of satellite galaxies when the subhalo to which 
that galaxy belonged is no longer identified by Subfind . 
Subhaloes can be 'lost' either because they have been com- 
pletely tidally disrupted or because the substructure finding 
algorithm has failed to fi nd them. In FigllOl fd ) we sh ow the 
profiles derived from the IDe Lucia &; Blaizotl (|2QQ7h model 
when only satellite galaxies with still existing subhaloes are 
included. This is equivalent to assuming that whenever a 
subhalo is lost the galaxy within it is completely destroyed. 
Here we see a very strong break in the profiles within rsoo, 
which is in obvious discord with the observations. The inter- 
esting implication of this experiment is that tidal disruption 
of satellites galaxies is much less efficient than the disruption 
of their surrounding dark matter subhaloJ^l 

In all three models when a satellite galaxy loses its dark 
matter subhalo the position of the satellite galaxy is assigned 
by using the current position of the most bound dark mat- 
ter particle of the subhalo at the time the subhalo was last 
identified. This procedure cannot be followed indefinitely, 
of course, as no merging would take place with the central 
galaxy, since a single dark matter particle does not experi- 
ence dynamical friction and therefore will not sink to the 
center. Therefore, what is done is to calculate a dynami- 
cal friction merger timescale, r m er g e, for the satellite galaxy. 
This timescale is calculated differently in the Durham and 
Munich models. The Durham models use a timescale that 
i s pro porti onal to the origin al formation of IChandrasekhar] 
(|l943h fsee lCole et alJ feOQO for details) and depends on the 
main halo mass, the mass of the satellite, and orbital en- 
ergy and angular momentum of the satellite. We note here 
that the mass of the satellite that is used is the total mass 
(gas+stars+dark matter) at the time of virial crossing of 
the main halo, rather than the time the subhalo was last 
identified. Also, the distribution of initial orbital param- 
eters of infalling satellites (at virial crossin g) is adopted 
from a fit to the cosmological simualtions of iTormenl fl997) 



17 Here we are making the assumption that the substructure find- 
ers are able to find most self- gravitating dark matter haloes. 



rather than using the orbital parameters of the satellites in 
the Millennium Simulation (we note, however, that a newer 
version of the GALFORM code allows for this possibility). 
The constant of proportionality, which is of order unity, is 
treated as a free parameter and varied to improve the match 
to the break in th e gal axy luminos i ty f unction. Both the 
IBower et all (|20Q6h and IFont et al.l <|2008h models use the 
same constant of proportionality. The Munich model, by 
contrast, uses the properties of the satellite (i.e., its mass 
and distance from the center) at the time its subhalo was 
last identified, which should be better, but their timescale 
calculation lacks any dependence on the orbital parameters, 
which should be relevant. There is also a constant of pro- 
port ionahty_jxi_t^ time scale in the Munich model, 
which IDe Lucia fe Blaizotl <|2007h have tuned to improve the 
match to the break in the luminosity function. 

Once a satellite galaxy has been in orbit for a time equal 
to the merger timescald 18 ! the satellite is removed from the 
catalog and its mass is added to the central galaxy. Note 
that none of the models take into account mass loss due to 
tidal forces over the course of this merger timescale. 

The larger degree of flattening of the profiles from the 
IDe Lucia &; Blaizotl (|2007h model would suggest a generally 
shor ter merger timesca le tha n that which is be ing adopted in 
the IBower etaD (|2006h and lFont et all (|2008h . Why this is 
the case is difficult to ascertain, given the large differences in 
the wa y in which these st udies calculate this timesca l e. Re - 
cently. " I Jiang et al.1 (j2008T ) and iBovlan-Kolchin et all (|20Q8h 
used detailed numerical simulations to show that the current 
implementations of the dynamical friction merger timescale 
in the semi-analytic models do not accurately reproduce the 
true merger timescale from the simulations, in that they tend 
to underestimate the timescale for low mass satellites and 
overestimate it for massive satellites. This is likely due to a 
failure of a nu mber of assum ptions, including the neglect of 
mass loss (see Benson (2010) for further discussion). 

In reality, the flattening of the satellites radial profiles 
is likely to be due to a combination of merging with the 
central galaxy and tidal mass loss of the satellites as they 
orbit about cluster. The fact that galaxy clusters contain 
detectable amounts of intra-cluster lig ht (ICL) (see, e.g., 
iGonzalez et al.ll2005l : IZibetti et al.l 120051 ) demonstrates that 
the latter mechanism has to be taking place as well. Disen- 
tangling these two processes relies upon understanding the 
complex interplay between the satellite profiles, the BCG 
luminosities and the ICL. In a forthcoming study (Koposov 
et al, in preparation) we will present self- consistent measure 
of ICL for the cluster sample presented here. In principle 
this should allow us to take a large jump towards under- 
standing the complicated interplay between tidal stripping 
and satellite merging. 



7 CONCLUSIONS 

We have used the power of both spectroscopic and pho- 
tometric datasets published as part of the SDSS DR7 to 

18 Note the 'clock' starts ticking at different times for the 
Durham and Munich models. It starts at virial crossing for the 
Durham models and at the time the subhalo was last identified 
for the Munich model. 
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measure the satellite profiles of clusters with Luminous Red 
Galaxies (LRGs) at their centers. We take advantage of the 
large volume of the SDSS to probe a substantial range in 
halo mass (10 13 7 < M 50 o < 10 15 M©), and utilize the 
SDSS spectroscopic BCG sample to probe clusters out to a 
reasonably high redshift (z = 0.4). 

Using the "anchor" sample of SDSS clusters with ac- 
curate X-ray temperature measurements we link the optical 
richness with the cluster's extent and total mass rsoo , wisoo • 
Armed with this robust measurement of halo mass, we 
have constructed a large catalogue of ~ 20 000 BCG clus- 
ters, which is highly pure and complete at halo masses of 
M500 > 10 14 Mq. The estimates of cluster mass are used to 
partition clusters in four bins according to the total amount 
of dark matter. In each mass bin, high signal-to-noise satel- 
lite profile stacks are produced. These reveal that accross 
all masses, the satellites are systematically less concentrated 
than the dark matter, roughly by a factor of two. Interest- 
ingly, in spite of the difference in shape between the galaxy 
and DM radial distributions, both exhibit a high degree of 
self- similarity (i.e. no strong change in shape of the profile 
as a function of mass). We find a strong evolution in the 
concentration of the satellite profile as a function of satel- 
lite brightness and colour and discuss physical mechanisms 
to explain this observed behaviour. We find only very weak 
dependencies of the satellite radial distribution on cluster 
redshift (over the range 0.15-0.4) or BCG luminosity. 

We made a self-consistent comparison with a number of 
recent semi- analytic models of galaxy formation. We showed 
that: (1) beyond approx. 0.3rsoo current models are able to 
reproduce both the shape and normalisation of the satel- 
lite profiles; and (2) within 0.3rsoo the predicted profiles are 
sensitive to the details of the satellite-BCG merger timescale 
calculation. The former is a direct result of the models being 
tuned to match the global galaxy luminosity function com- 
bined with the assumption that the satellite galaxies do not 
suffer significant tidal stripping, even though their surround- 
ing DM haloes can be removed through this process. The 
deviation within ^0.3 r500 implies that the semi-analytic 
models have too long of a merger timescale and/or tidal 
stripping of the stellar component of galaxies, prior to merg- 
ing with the BCG, is important (the effects of tidal stripping 
are not taken into account in the models that we have con- 
sidered). The fact that groups and clusters demonstrably 
have non-neglibible stellar mass in a diffuse component (the 
ICL) indeed suggests that the latter process is relevant. In a 
forthcoming study we will present stacked measurements of 
the ICL for the cluster sample presented here. When com- 
bined with our measurements of the satellite population and 
the BCG it should be possible to gain a more complete un- 
derstanding of the evolution/fate of the satellite population 
and could serve as a means to inform theoretical models on 
the efficacy of the tidal stripping and merging processes. 
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APPENDIX A: LINEAR MODEL FITTING 

This describes the formalism from Hog g et al ] <|2010h for the 
fitting of a straight line model with slope m and intercept 
b, to data with two dimensional uncertainties. 
The orthogonal displacement A* of each data point from the 
line is given by: 



b cos t 



(Al) 



where {pa, yi) is the individual data point, and v is the unit 
vector orthogonal to the line given by: 



(A2) 



—m 




— sin# 




1 




cos 





Each data points covariance matrix is given by: 
S? = v T S t v , 



(A3) 



where the covariance matrix of horizontal and vertical mea- 
surement uncertainties corresponds to: 

S, = a * 1 a T 1 . (A4) 



APPENDIX B: MAXBCG RADIAL PROFILES 



Fig. lBll shows the raw unsealed satell ite number den sity pro- 
files for the MaxBCG clusters (|Koester et alJl2QQ7h , in four 
mass bins. The profiles are very similar to those generated 
using the cluster sample described in this paper (Fig. [5]). 



APPENDIX C: ERROR MODELLING 

In order to estimate the scatter in number density profiles 
we perform the following analysis. As described in the text 
we are interested in the scatter of the scaled surface density: 
H ga i = I/R500 ^o, 9 ai as a function of log(R/R 5 oo). Our data 
consist of the measurements of the number of galaxies in 
circular annuli for different clusters and the corresponding 
measurements of the background galaxy density. 

First we consider the expected number of galaxies in 
the i-th bin of log(R/Rsoo) for the galaxy cluster j: 



expected, i,j 



'■ Aij {R5OOJ ^i,j + ^bg,j) 



(CI) 



where Aij is the area of the i-th annulus for j-th cluster, 
R500J is the R500 of the j-th cluster, E^j is a scaled den- 
sity of galaxies in the cluster and ^b g ,j is the background 
galaxy density. In the simplest scenario we would assume 
that Ei,j = Ei is constant for all clusters, but in this case 
we will assume that the T>ij is Gaussian distributed with 
the mean E« and dispersion Si. 

Now we can write the likelihood of the parameters 
(Ej , Si, Tib g ,ijTiij) given observed Nij galaxies in the i-th 
bin of the j-th cluster and Nb g ,i background galaxies ob- 
served in the random 5Mpc radius field: 



£3 5 $>i , ^bg,i ? \NiJ , Nbg,i) — 
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Figure Bl. The mean satellite number density profiles as a func- 
tion of radius for stacks of MaxBCG cluster satellites in bins of 
halo mass. The black points show the representative mean error 
bar on the profiles as a function of radius. It is clear that the 
higher mass clusters have a larger number of satellites than the 
lower mass bins. 



where the first term is Poisson probability of observing the 
Nij objects given a certain density of galaxies (including 
background), the second term is the probability of having 
Ei,j given the mean density E« and scatter Si, and the third 
term is essentially is Gaussian probability of the background 
measurement from the random 5 Mpc radius field. In our 
case we are not interested in the values of (Ei ;J -, E&^i), so 
their are nuisance parameters for us and we can marginalize 
over them. Then the likelihood becomes only a function of 
E», Si'. 



(Ei , Si 



Ej, Si, Ylbg,i, E^j) dYlbg,i dYlij . (G3) 



The full likelihood for the i-th bin of the profile is then 
obtained by multiplying the likelihoods for different clusters 



£i(Ei, Si) — J^J £>ij (Ej, Si) . 



(C4) 



We then maximize that likelihood with respect to the 
{Hi, Si) parameters for each radial bin, which gives as es- 
timates of the mean and a scatter of the density profiles. 
For the error estimates of the parameters we use Hessian 
matrices numerically evaluated at the best-fit parameters. 



P{Ni,j\N expected ^j) P{Z t ,j\Zi, S t ) P{N bg , t \H 



bg,i) 



(C2) 
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